智能论文笔记

Deep Active Visual Attention for Real-time Robot Motion Generation: Emergence of Tool-body Assimilation and Adaptive Tool-use

Hyogo Hiruma , Hiroshi Ito , Hiroki Mori , Tetsuya Ogata

分类：机器人 | 人工智能

2022-06-29

充分感知环境是机器人运动产生的关键因素。尽管引入深层视觉处理模型有助于扩展这种能力，但现有的方法缺乏积极修改感知内容的能力。人类在视觉认知过程中进行内部性能。本文通过提出一种新的机器人运动生成模型来解决问题，灵感来自人类的认知结构。该模型结合了一个由州驱动的主动自上而下的视觉注意模块，该模块获得了可以根据任务状态积极改变目标的注意事项。我们将这种注意力称为基于角色的注意力，因为获得的注意力集中在整个运动中共有连贯作用的目标。该模型经过了机器人工具使用任务的训练，在该任务中，基于角色的专注分别在对象拾取和对象拖动运动过程中将机器人抓手和工具视为相同的最终效果。这类似于一种称为工具体同化的生物学现象，其中一个人将处理工具视为身体的扩展。结果表明，模型的视觉感知的灵活性有所提高，即使为其提供了未经训练的工具或暴露于实验者的分心，也可以持续稳定的注意力和运动。

translated by 谷歌翻译

Contact-Rich Manipulation of a Flexible Object based on Deep Predictive Learning using Vision and Tactility

Hideyuki Ichiwara , Hiroshi Ito , Kenjiro Yamamoto , Hiroki Mori , Tetsuya Ogata

分类：机器人

2021-12-13

我们实现了接触的灵活物体操作，这很难单独使用视力控制。在解压缩任务中，我们选择作为验证任务，夹具抓住拉动器，它隐藏袋子状态，例如其背后的变形的方向和量，使得仅通过视觉获取信息来执行任务。此外，柔性织物袋状态在操作期间不断变化，因此机器人需要动态地响应变化。然而，所有袋子状态的适当机器人行为难以提前准备。为了解决这个问题，我们开发了一种模型，可以通过具有触觉的视觉的实时预测来执行接触的灵活性对象操纵。我们介绍了一种基于点的注意机制，用于提取图像特征，Softmax转换来提取预测运动，以及用于提取触觉特征的卷积神经网络。使用真正的机器人手臂的实验结果表明，我们的方法可以实现响应袋子变形的运动，同时减少拉链上的负荷。此外，与单独的视觉相比，使用触觉从56.7％提高到93.3％，展示了我们方法的有效性和高性能。

translated by 谷歌翻译

Gaussian Process Classification Bandits

Tatsuya Hayashi , Naoki Ito , Koji Tabata , Atsuyoshi Nakamura , Katsumasa Fujita , Yoshinori Harada , Tamiki Komatsuzaki

分类：机器学习

2022-12-26

Classification bandits are multi-armed bandit problems whose task is to classify a given set of arms into either positive or negative class depending on whether the rate of the arms with the expected reward of at least h is not less than w for given thresholds h and w. We study a special classification bandit problem in which arms correspond to points x in d-dimensional real space with expected rewards f(x) which are generated according to a Gaussian process prior. We develop a framework algorithm for the problem using various arm selection policies and propose policies called FCB and FTSV. We show a smaller sample complexity upper bound for FCB than that for the existing algorithm of the level set estimation, in which whether f(x) is at least h or not must be decided for every arm's x. Arm selection policies depending on an estimated rate of arms with rewards of at least h are also proposed and shown to improve empirical sample complexity. According to our experimental results, the rate-estimation versions of FCB and FTSV, together with that of the popular active learning policy that selects the point with the maximum variance, outperform other policies for synthetic functions, and the version of FTSV is also the best performer for our real-world dataset.

translated by 谷歌翻译

Influence of collaborative customer service by service robots and clerks in bakery stores

Yuki Okafuji , Sichao Song , Jun Baba , Yuichiro Yoshikawa , Hiroshi Ishiguro

分类：机器人

2022-12-20

In recent years, various service robots have been introduced in stores as recommendation systems. Previous studies attempted to increase the influence of these robots by improving their social acceptance and trust. However, when such service robots recommend a product to customers in real environments, the effect on the customers is influenced not only by the robot itself, but also by the social influence of the surrounding people such as store clerks. Therefore, leveraging the social influence of the clerks may increase the influence of the robots on the customers. Hence, we compared the influence of robots with and without collaborative customer service between the robots and clerks in two bakery stores. The experimental results showed that collaborative customer service increased the purchase rate of the recommended bread and improved the impression regarding the robot and store experience of the customers. Because the results also showed that the workload required for the clerks to collaborate with the robot was not high, this study suggests that all stores with service robots may show high effectiveness in introducing collaborative customer service.

translated by 谷歌翻译

Materials Discovery using Max K-Armed Bandit

Nobuaki Kikkawa , Hiroshi Ohno

分类： (统计)机器学习 | 机器学习

2022-12-16

Search algorithms for the bandit problems are applicable in materials discovery. However, the objectives of the conventional bandit problem are different from those of materials discovery. The conventional bandit problem aims to maximize the total rewards, whereas materials discovery aims to achieve breakthroughs in material properties. The max K-armed bandit (MKB) problem, which aims to acquire the single best reward, matches with the discovery tasks better than the conventional bandit. Thus, here, we propose a search algorithm for materials discovery based on the MKB problem using a pseudo-value of the upper confidence bound of expected improvement of the best reward. This approach is pseudo-guaranteed to be asymptotic oracles that do not depends on the time horizon. In addition, compared with other MKB algorithms, the proposed algorithm has only one hyperparameter, which is advantageous in materials discovery. We applied the proposed algorithm to synthetic problems and molecular-design demonstrations using a Monte Carlo tree search. According to the results, the proposed algorithm stably outperformed other bandit algorithms in the late stage of the search process when the optimal arm of the MKB could not be determined based on its expectation reward.

translated by 谷歌翻译

Best-Answer Prediction in Q&A Sites Using User Information

Rafik Hadfi , Ahmed Moustafa , Kai Yoshino , Takayuki Ito

分类：自然语言处理 | 人工智能 | 机器学习

2022-12-15

Community Question Answering (CQA) sites have spread and multiplied significantly in recent years. Sites like Reddit, Quora, and Stack Exchange are becoming popular amongst people interested in finding answers to diverse questions. One practical way of finding such answers is automatically predicting the best candidate given existing answers and comments. Many studies were conducted on answer prediction in CQA but with limited focus on using the background information of the questionnaires. We address this limitation using a novel method for predicting the best answers using the questioner's background information and other features, such as the textual content or the relationships with other participants. Our answer classification model was trained using the Stack Exchange dataset and validated using the Area Under the Curve (AUC) metric. The experimental results show that the proposed method complements previous methods by pointing out the importance of the relationships between users, particularly throughout the level of involvement in different communities on Stack Exchange. Furthermore, we point out that there is little overlap between user-relation information and the information represented by the shallow text features and the meta-features, such as time differences.

translated by 谷歌翻译

P2Net: A Post-Processing Network for Refining Semantic Segmentation of LiDAR Point Cloud based on Consistency of Consecutive Frames

Yutaka Momma , Weimin Wang , Edgar Simo-Serra , Satoshi Iizuka , Ryosuke Nakamura , Hiroshi Ishikawa

分类：计算机视觉 | 机器人

2022-12-01

We present a lightweight post-processing method to refine the semantic segmentation results of point cloud sequences. Most existing methods usually segment frame by frame and encounter the inherent ambiguity of the problem: based on a measurement in a single frame, labels are sometimes difficult to predict even for humans. To remedy this problem, we propose to explicitly train a network to refine these results predicted by an existing segmentation method. The network, which we call the P2Net, learns the consistency constraints between coincident points from consecutive frames after registration. We evaluate the proposed post-processing method both qualitatively and quantitatively on the SemanticKITTI dataset that consists of real outdoor scenes. The effectiveness of the proposed method is validated by comparing the results predicted by two representative networks with and without the refinement by the post-processing network. Specifically, qualitative visualization validates the key idea that labels of the points that are difficult to predict can be corrected with P2Net. Quantitatively, overall mIoU is improved from 10.5% to 11.7% for PointNet [1] and from 10.8% to 15.9% for PointNet++ [2].

translated by 谷歌翻译

jaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus

Tomohiko Nakamura , Shinnosuke Takamichi , Naoko Tanji , Satoru Fukayama , Hiroshi Saruwatari

分类：机器学习

2022-11-29

We construct a corpus of Japanese a cappella vocal ensembles (jaCappella corpus) for vocal ensemble separation and synthesis. It consists of 35 copyright-cleared vocal ensemble songs and their audio recordings of individual voice parts. These songs were arranged from out-of-copyright Japanese children's songs and have six voice parts (lead vocal, soprano, alto, tenor, bass, and vocal percussion). They are divided into seven subsets, each of which features typical characteristics of a music genre such as jazz and enka. The variety in genre and voice part match vocal ensembles recently widespread in social media services such as YouTube, although the main targets of conventional vocal ensemble datasets are choral singing made up of soprano, alto, tenor, and bass. Experimental evaluation demonstrates that our corpus is a challenging resource for vocal ensemble separation. Our corpus is available on our project page (https://tomohikonakamura.github.io/jaCappella_corpus/).

translated by 谷歌翻译

Resilience of Wireless Ad Hoc Federated Learning against Model Poisoning Attacks

Naoya Tezuka , Hideya Ochiai , Yuwei Sun , Hiroshi Esaki

分类：机器学习

2022-11-07

Wireless ad hoc federated learning (WAFL) is a fully decentralized collaborative machine learning framework organized by opportunistically encountered mobile nodes. Compared to conventional federated learning, WAFL performs model training by weakly synchronizing the model parameters with others, and this shows great resilience to a poisoned model injected by an attacker. In this paper, we provide our theoretical analysis of the WAFL's resilience against model poisoning attacks, by formulating the force balance between the poisoned model and the legitimate model. According to our experiments, we confirmed that the nodes directly encountered the attacker has been somehow compromised to the poisoned model but other nodes have shown great resilience. More importantly, after the attacker has left the network, all the nodes have finally found stronger model parameters combined with the poisoned model. Most of the attack-experienced cases achieved higher accuracy than the no-attack-experienced cases.

translated by 谷歌翻译

Time series quantile regression using random forests

Hiroshi Shiraishi , Tomoshige Nakamura , Ryotato Shibuki

分类： (统计)机器学习

2022-11-04

We discuss an application of Generalized Random Forests (GRF) proposed by Athey et al.(2019) to quantile regression for time series data. We extracted the theoretical results of the GRF consistency for i.i.d. data to time series data. In particular, in the main theorem, based only on the general assumptions for time series data in Davis and Nielsen (2020), and trees in Athey et al.(2019), we show that the tsQRF (time series Quantile Regression Forests) estimator is consistent. Davis and Nielsen (2020) also discussed the estimation problem using Random Forests (RF) for time series data, but the construction procedure of the RF treated by the GRF is essentially different, and different ideas are used throughout the theoretical proof. In addition, a simulation and real data analysis were conducted.In the simulation, the accuracy of the conditional quantile estimation was evaluated under time series models. In the real data using the Nikkei Stock Average, our estimator is demonstrated to be more sensitive than the others in terms of volatility, thus preventing underestimation of risk.

translated by 谷歌翻译